Method and apparatus for dynamically managing bandwidth for clients in a storage area network

ABSTRACT

A method for managing bandwidth allocation in a storage area network includes receiving a plurality of Input/Output (I/O) requests from a plurality of client devices, determining a priority of each of the client devices relative to other client devices, and dynamically allocating bandwidth resources to each client device based on the priority assigned to that client device.

FIELD OF THE INVENTION

[0001] The present invention relates to the field of data storage. More particularly, the present invention relates to dynamically restricting the bandwidth of one I/O path in a storage area network to provide additional bandwidth to another I/O path.

BACKGROUND OF THE INVENTION

[0002] The use of computers and computer networks pervade virtually every business and other enterprise in the modern world. With computers, users generate vast quantities of data that can be stored for a variety of purposes. This storehouse of data can grow at a phenomenal pace and become critically valuable to those who have generated it. Consequently, there is an ever-present need for data storage systems that improve on capacity, speed, reliability, etc.

[0003] In a single computer, the primary data storage device is usually a hard drive with a storage capacity measured in gigabytes. Additionally, computers may store data using such devices as CD-ROM drives, floppy disk drives, tape drive, etc. Within a computer network, the computers of the network may also store data on network servers or other data storage devices, such as those mentioned above, that are accessible through the network. For larger systems with even greater data storage needs, arrays of data storage disks may be added to the network.

[0004] Storage Area Networks (SANs) are an emerging technology being implemented to accommodate high-capacity data storage devices, particularly disk arrays, within a network. A SAN is essentially a high-speed network between client devices, such as servers and data storage devices, particularly disk arrays. A SAN overcomes the limitations and inflexibility of traditional attached data storage.

[0005] A SAN can overcome limitations of traditional attached data storage but also introduces new considerations. In particular, SANs experience competition for resources when more than one server is attempting to access the same data storage device. A typical storage device has a limited amount of bandwidth in its Input/Output (I/O) paths, and this bandwidth must be shared by the clients accessing the storage device.

SUMMARY OF THE INVENTION

[0006] In one of many possible embodiments, the present invention provides a method for managing bandwidth allocation in a storage area network that receiving a plurality of Input/Output (I/O) requests from a plurality of client devices, determining a priority of each of the client devices relative to other client devices, and dynamically allocating bandwidth resources to each client device based on the priority assigned to that client device.

BRIEF DESCRIPTION OF THE DRAWINGS

[0007] The accompanying drawings illustrate various embodiments of the present invention and are a part of the specification. Together with the following description, the drawings demonstrate and explain the principles of the present invention. The illustrated embodiments are examples of the present invention and do not limit the scope of the invention.

[0008]FIG. 1 is a block diagram illustrating an embodiment of a system according to principles of the present invention.

[0009]FIG. 2 is a block diagram illustrating an additional embodiment of a system according to principles of the present invention.

[0010]FIG. 3 is a block diagram illustrating an additional embodiment of a system according to principles of the present invention.

[0011]FIG. 4 is a block diagram of a system according to one embodiment of the present invention.

[0012]FIG. 5 is a flow diagram for assigning array performance groups according to principles of one embodiment of the present invention.

[0013] Throughout the drawings, identical reference numbers designate similar, but not necessarily identical, elements.

DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION

[0014] Embodiments of the invention include a method for managing bandwidth associated with a Storage Area Network (SAN). According to one exemplary embodiment, described more fully below, an innovative method limits the bandwidth associated with one I/O path so that a different I/O path may consume the extra bandwidth. In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the invention. It will be apparent, however, to one skilled in the art that the invention can be practiced without these specific details.

[0015] Reference in the specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the invention. The several appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment.

[0016] Example Overall Structure

[0017] Storage area networks vary in size and complexity, and are flexible in their configurations for meeting the storage needs of a network. A simplified storage area network configuration is depicted in FIG. 1 to illustrate the transfer of data between a limited number of devices interfaced with a storage area network. More complex storage area networks may interface with any number of devices as needed to meet a given user's storage needs.

[0018]FIG. 1 illustrates a data retrieval system according to one embodiment of the present invention. As shown in FIG. 1, an embodiment of a data retrieval system includes a number of servers or host computers (100, 110), referred to collectively as “clients.” As demonstrated in FIG. 1, data retrieval systems may assign a different priority to each client within the data retrieval system, as illustrated by the priority server (100) and the non-priority server (110). Each server is communicatively coupled to a Host Bus Adapter (HBA) (115) which is in turn coupled to a communication line (120).

[0019] The communication line (120) that couples the servers (100, 110) to the storage disk array (150) is preferably a fibre channel loop compliant with the “Fibre Channel Physical and Signaling Interface” ((FC-PH) Rev. 4.3, X3T11, Jun. 1, 1994 standard, American National Standards for Information Systems), which standard is hereby incorporated by reference. Each device on the loop (120), by virtue of the fiber channel host bus adapter, has a unique identifier referred to as its world wide name (WWN). The present invention may also use any unique identifier associated with the servers (100, 110) so long as that identifying means is unique for each device among the interconnected devices.

[0020] Continuing in the direction of the communication line (120), the line (120) is fed into a fibre channel switch (130). The switch (130) continues on to a port (140) of the storage disk array (150).

[0021] In computing systems, storage disk arrays (150) divide the storage into a number of logical volumes. These volumes are accessed through a logical unit number (LUN) (155) addressing scheme as is common in SCSI protocol based storage systems, including SCSI protocol based, fibre channel loop, physical layer configurations. The term LUN refers to a logical unit or logical volume, or, in the context of a SCSI protocol based device or system, to an SCSI logical unit or SCSI logical volume.

[0022] Those of ordinary skill in the art will appreciate that the number of physical disk drives may be the same as, or different from, the number of logical drives or logical volumes. However, for the sake of simplicity and clarity, we use these terms interchangeably here, focusing primarily on logical volumes as compared to the physical disk drives that make up those logical volumes.

[0023] The storage disk array (150) also contains a resource manager (160). The resource manager (160) contains firmware that enables the resource manager (160) to identify each server (100, 110) accessing the storage array (150) and to allot I/O bandwidth at the port (140) to each such server (100, 110) as specified by the firmware.

[0024]FIG. 2 illustrates an additional configuration of one embodiment of the present invention. As shown in FIG. 2, a number of servers (200, 210) may be connected via a fibre channel loop (220) to a plurality of FC switches (230) leading to a plurality of storage disk arrays (250) that are communicatively coupled to the network through the switches (230). It will be appreciated by those of ordinary skill in the art that the present invention may be practiced with a number of configurations without varying from the teachings of the present invention.

[0025] Exemplary Implementation and Operation

[0026] As mentioned earlier, SANs experience competition for resources when more than one client is attempting to access the same data storage device. A typical storage device has a limited amount of bandwidth in its I/O paths and this bandwidth should be properly apportioned out to the clients accessing the storage device. An I/O path is the path from the client's Host Bus Adapter (HBA), over a Storage Network, to a block of storage on a storage device (e.g. a disk array) (250). In order to properly allocate the bandwidth, the system recognizes that the host systems are not all of the same priority, i.e., in order to optimize the operation of the overall system, some clients or servers need more I/O performance and bandwidth from the storage devices (250) than do other clients. In order to maximize the performance of the system, the maximum storage device performance available for lower priority client systems shouldn't impact the storage device performance available to higher priority clients.

[0027] The resource manager (260) is a product that monitors the I/O performance and bandwidth usage of the storage system, and sets performance caps based on user-established policies. One aspect of the present invention concerns the ability to set an upper limit or cap and a minimum threshold on bandwidth usage. The cap limits the amount of bandwidth a client may use at any one time. The minimum threshold establishes a minimum level of performance below which the user-defined policies, i.e., the caps, are relaxed. There are a number of ways to administer caps and thresholds, including, but not limited to, assigning a cap and/or threshold to each port, to each client/array port pair, or to each client/array LUN pair.

[0028]FIG. 3 illustrates an embodiment of the present invention that bases the threshold and cap on a port. When multiple I/O paths cross the same piece of hardware (e.g. an array port), contention may occur. The path between each client and the storage disk array (350) is considered a separate I/O path here. By allowing one I/O path (non-priority) to be throttled or capped above a certain performance level, the other I/O path (priority) can be allowed to use the extra I/O on the Port. Thus, utilization of the I/O path is optimized. This concept can be expanded out to a very large Storage Network, but can get difficult to manage. By grouping servers into priority categories (Groups), a single setting can be made to all servers in a category automatically.

[0029] In FIG. 3, the servers (300, 310) have been grouped into priority groups: priority servers (300) and ordinary servers (310). Each group has a number of HBAs (315) connecting the respective groups to the fibre channel loop (320) leading to a switch (330). Each switch (330) leads to a port (340, 345) for each server group. Port 1 (340) is dedicated to the group of ordinary servers (310) and port 2 (345) is dedicated to the priority servers (300).

[0030] By providing independent ports to each respective group of clients, the resource manager (360) can allocate bandwidth resources to each port in proportion to the importance of the corresponding client group. By assigning a cap and a threshold quantity to each port, the bandwidth can be efficiently distributed. By way of example only, if logic unit (355) of the storage disk array (350) can only handle 7,000 input and output operations per second (IOPS), port 1(340) may be capped at 2,000 IOPS. By setting the cap at 2,000 IOPS for port 1 (340), all the servers attached to port 2 (345) can access the remaining 5,000 IOPS associated with the logic unit (355). Accordingly, port 2 may be capped at 5,000 IOPS. By assigning a threshold equal to the aforementioned cap at port 1 (340) and port 2 (345), the bandwidth resources can be dynamically managed. If, by way of example only, port 1 (340) had a threshold of 2,000 IOPS and activity at port 1 (340) drops below that threshold, the cap assigned to port 2 (345) is subsequently released allowing the servers (300) associated with port 2 (345) to make use of the unused bandwidth.

[0031] The embodiment demonstrated in FIG. 3 may also be implemented using a single high priority server and a single regular priority server. In this embodiment, a single port (340, 345) is associated with each server. A cap may be implemented on one or both servers through the corresponding port (340, 345) with the residual bandwidth available to the server associated with the other port. Just as indicated above, a threshold may be implemented on either or both of the ports (340, 345). If the activity at one of the ports (340, 345) drops below the threshold assigned it, the assigned cap at the other port is subsequently released to allow the second server make use of the unused bandwidth.

[0032]FIG. 4 illustrates how a cap and threshold may be based upon a client/array port pair. As shown in FIG. 4, clients with different priorities (400, 410) may be commonly linked to the various ports (440, 445) of the storage disk array (450) rather than grouped as in FIG. 3. By virtue of the unique identifier WWN associated with each client, the resource manager (460) can identify which clients (400, 410) are high priority clients (400) and which ones are not (410). By recognizing which HBA (415) is associated with which client (400, 410), the resource manager (460) can place caps and/or thresholds at each port for specific clients.

[0033] By way of example only, FIG. 4 illustrates the client/array port pair embodiment of the present invention. If port 1 (440) is “capped” at 500 IOPS for HBAs 1, 3, and 5 only (415), and Port 1 (440) has threshold of 2,000 IOPS then if total activity on Port 1 (440) drops below 2,000 IOPs, the caps for HBAs 1, 3, and 5 (415) are released. If Port 2 (445) is “capped” at 500 IOPS for HBAs 1, 3, and 5 (415) only and Port 2 (445) has threshold of 3,000 IOPS, then if total activity on Port 2 (445) drops below 3,000 IOPs, the caps for HBAs 1, 3, and 5 (415) are released.

[0034] Similar to the client/array port pair embodiment of the present invention explained above, the client/array LUN pair embodiment of the present invention uses the resource manager (460) to identify the client (400, 410) requesting bandwidth performance and applying corresponding caps and thresholds to the individual logic unit (455) in the storage disk array (450) rather than the ports (440, 445). The dynamic management of the bandwidth resources is still triggered by a drop in activity below the threshold. However, in this embodiment, the activity is measured at the individual logic unit (455).

[0035] This invention allows for a fine level of user control over the threshold settings. By setting a threshold, the array can relax performance caps when they are not needed and, thereby, not unduly restrict the bandwidth available to the capped port, client/array port pairs, or host/LUN pairs. When all of the bandwidth resources are being used, they are distributed according to priority designations. If, however, activity drops below threshold values, caps for each group may be released to allow for a dynamic redistribution of the available resources. The settings can be made in either I/O per second or MB per second.

[0036] Often computer systems have periodic spurts of activity or schedules on which they operate. Events such as nightly backups or daytime merchant hours affect the quantity of I/O traffic and the required quality of service. In order to maximize the efficiency of bandwidth resources, the present invention allows the caps and threshold to be time dependant corresponding to predicted spurts of activities.

[0037] In an additional embodiment of the present invention is demonstrated in FIG. 5. As shown in FIG. 5, the priority of a client may be determined by its application performance requirements. Using the command line interface or script API of a storage management software application, a user application can interrogate various storage arrays and add its connectivity port to an array performance group which meets the application's transaction bandwidth.

[0038] As demonstrated in FIG. 5, this embodiment allows for the dynamic grouping of host applications by acquiring information on available performance groups within the storage array.

[0039] Initially, the user application evaluates an established group (610). The user application first determines if the performance cap of the first established group is greater than or equal to the application's bandwidth requirement (620). If it is, there is sufficient room in the established group to include the desired application. In that case, the port associated with the application is added to the first established group (630). If, however, the performance cap of the first established group is less than the application bandwidth requirement, the user application determines if there are additional groups (640).

[0040] If there are additional groups, the application again evaluates the cap (620) to see if there is sufficient bandwidth to perform the desired application. If all of the groups have been considered and none meet the application bandwidth requirements, the application is added to the group with the highest cap (650).

[0041] The embodiment of the present invention disclosed above allows for dynamic grouping of server priorities. In this manner the groups may be arranged so as to utilize the maximum bandwidth available.

[0042] Using the command line interface, script API or Web-based Graphical User Interface (GUI) of a storage management software application, a user application can also dynamically add connectivity bandwidth and increase the performance capability of a host application by controlling multiple connectivity paths between the host computer and the storage array.

[0043] When a host application indicates a desire for increased performance capability, the resource manager can then increase available bandwidth accordingly by dedicating additional ports to the host application. This embodiment of the present invention can then be tied into billing applications for demand-based performance such as pay for performance applications.

[0044] The preceding description has been presented only to illustrate and describe the invention. It is not intended to be exhaustive or to limit the invention to any precise form disclosed. Many modifications and variations are possible in light of the above teaching.

[0045] The foregoing embodiments were chosen and described in order to illustrate principles of the invention and its practical applications. The preceding description is intended to enable others skilled in the art to best utilize the invention in various embodiments and with various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims. 

What is claimed is:
 1. A method for managing bandwidth allocation in a storage area network comprising: receiving a plurality of Input/Output (I/O) requests from a plurality of client devices; determining a priority of each of said client devices relative to other client devices; and dynamically allocating bandwidth resources to each said client device based on the priority assigned to that client device.
 2. The method of claim 1, wherein dynamically allocating bandwidth resources to said plurality of client devices comprises: setting an upper limit on an amount of bandwidth allocated to a priority of said client devices; and setting a lower threshold for said priority of client devices.
 3. The method of claim 2, further comprising applying said limit and threshold at ports of a storage device of said storage area network.
 4. The method of claim 3, further comprising connecting only clients of a same priority to a particular port of said storage device.
 5. The method of claim 2, wherein said limit and threshold are applied at a logical unit of a storage device of said storage area network.
 6. The method of claim 2, wherein, if I/O requests for one or more client devices drop below the corresponding lower threshold, the upper limit for one or more other client devices is released.
 7. The method of claim 1, wherein determining a priority for each of said client devices comprises receiving an indication from a user of said priority for each of said client devices.
 8. The method of claim 7, wherein said priority levels each require a different fee corresponding to said bandwidth allocation.
 9. The method of claim 1, wherein determining said priority for each of said client devices further comprises: obtaining information on available performance groups within a storage array; determining if a performance cap on a performance group within said storage array prevents the addition of one of said I/O requests; and if said performance cap does not prevent addition of one of said I/O requests, adding I/O requests from that client to said performance group.
 10. The method of claim 1, wherein determining priority of each of said client devices comprises grouping said plurality of client devices into groups according to I/O performance requirements for said client devices.
 11. The method of claim 10, wherein each client device in one of said groups receives a same bandwidth allocation setting as other client devices in that group.
 12. A storage area network comprising: a plurality of client devices; and a data storage system communicating with said plurality of client devices; wherein said data storage system dynamically allocates bandwidth resources to said plurality of client devices based on a priority assigned to each said client device.
 13. A storage area network according to claim 12, further comprising a resource manager of said data storage system for allocating said bandwidth resources among said client device, wherein said resource manager: sets an upper limit on an amount of bandwidth allocated to each of said client devices; and sets a lower threshold on an amount of bandwidth allocated to each of said client devices; wherein if said bandwidth resources used by one or more client devices are lower than a corresponding lower threshold, upper limits for one or more other client devices are released.
 14. A storage area network according to claim 13, wherein said upper limit is substantially equal to said lower threshold value.
 15. A storage area network according to claim 12, wherein said data storage system comprises a disk array.
 16. A storage area network according to claim 12, further comprising a fibre channel loop connected between said plurality of client devices and said data storage system.
 17. A storage area network according to claim 12, wherein said plurality of client devices comprise a network server.
 18. A storage area network according to claim 12, wherein said plurality of client devices are divided into groups according to said priority, wherein a group of client devices all receive a same bandwidth allocation setting.
 19. A storage area network according to claim 18, further comprising a plurality of ports of said data storage system, wherein all client devices of one of said groups are connected to said data storage system through a single one of said ports.
 20. A storage area network according to claim 18, wherein said priority levels each require a different fee corresponding to said bandwidth allocation.
 21. A storage area network according to claim 13, wherein said upper limit and lower threshold are measured in Megabytes per second.
 22. A storage area network according to claim 13, wherein said upper limit and lower threshold are measured in input and output operations per second.
 22. A storage area network according to claim 13, wherein either or both of said upper limit and lower threshold vary with time.
 23. A system for managing bandwidth allocation in a storage area network comprising: means for receiving a plurality of Input/Output (I/O) requests from a plurality of client devices; means for determining a priority of each of said client devices relative to other client devices; and means for dynamically allocating bandwidth resources to each said client device based on the priority assigned to that client device.
 24. The system of claim 23, wherein said means for dynamically allocating bandwidth resources to said plurality of client devices comprises: means for setting an upper limit on an amount of bandwidth allocated to a priority of said client devices; and means for setting a lower threshold for said priority of client devices.
 25. The system of claim 24, further comprising means for applying said limit and threshold at ports of a storage device of said storage area network.
 26. The system of claim 25, further comprising means for connecting only clients of a same priority to a particular port of said storage device.
 27. The system of claim 24, further comprising means for applying said limit and threshold at a logical unit of a storage device of said storage area network.
 28. The system of claim 24, further comprising means for releasing the upper limit for one or more client devices, if I/O requests for one or more other client devices drop below the corresponding lower threshold.
 29. The system of claim 23, wherein said means for determining a priority for each of said client devices comprise user interface means for receiving an indication from a user of said priority for each of said client devices.
 30. The system of claim 23, wherein said means for determining said priority for each of said client devices further comprises: means for obtaining information on available performance groups within a storage array; means for determining if a performance cap on a performance group within said storage array prevents the addition of one of said I/O requests; and if said performance cap does not prevent addition of one of said I/O requests, means for adding I/O requests from that client to said performance group.
 31. The system of claim 23, wherein said means for determining priority of each of said client devices comprise means for grouping said plurality of client devices into groups according to I/O performance requirements for said client devices.
 32. The system of claim 31, wherein each client device in one of said groups receives a same bandwidth allocation setting as other client devices in that group.
 33. Computer-readable instructions stored on a medium for storing computer-readable instructions, said instructions, when executed, causing a resource manager in a data storage system to: receive a plurality of Input/Output (I/O) requests from a plurality of client devices; determine a priority of each of said client devices relative to other client devices; and dynamically allocate bandwidth resources to each said client device based on the priority assigned to that client device.
 34. The computer-readable instructions of claim 33, wherein said instructions further cause said resource manager to dynamically allocate bandwidth resources to said plurality of client devices by: setting an upper limit on an amount of bandwidth allocated to a priority of said client devices; and setting a lower threshold for said priority of client devices.
 35. The computer-readable instructions of claim 34, wherein said instructions further cause said resource manager to dynamically allocate bandwidth resources to said plurality of client devices by applying said limit and threshold at ports of a storage device of said storage area network.
 36. The computer-readable instructions of claim 34, wherein said instructions further cause said resource manager to dynamically allocate bandwidth resources to said plurality of client devices by applying said limit and threshold at a logical unit of a storage device of said storage area network.
 37. The computer-readable instructions of claim 34, wherein said instructions further cause said resource manager to dynamically allocate bandwidth resources to said plurality of client devices by releasing the upper limit for one or more other client devices, if I/O requests for one or more client devices drop below the corresponding lower threshold.
 38. The computer-readable instructions of claim 33, further comprising a user interface for receiving an indication from a user of said priority for each of said client devices.
 39. The computer-readable instructions of claim 33, further causing said resource manager to: obtain information on available performance groups within a storage array; determine if a performance cap on a performance group within said storage array prevents the addition of one of said I/O requests from a particular client; and if said performance cap does not prevent addition of one of said I/O requests, add I/O requests from that client to said performance group. 